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^ , Abstract 

The branching program is a fundamental model of nonuniform computation, which conveniently captures both 
time and space restrictions. In this paper we prove a quadratic expected time-space tradeoff of the form TS = 
O(y) for q-v/ay deterministic decision branching programs, where q ^ 2. Here T is the expected computation 
i— i • time and S is the expected space, when all inputs are equally likely. This bound is to our knowledge, the first such 

to show an exponential size requirement whenever T = 0(n 2 ). Previous exponential size tradeoffs for Boolean 
decision branching programs were valid for time-restricted models with T = o(nlog 2 n). Proving quadratic time- 
^ ' space tradeoffs for unrestricted time decision branching programs has been a major goal of recent research - this 

O ■ goal has already been achieved for multiple-output branching programs a few decades ago. The decision branching 

programs we consider are related to families of good linear codes. 
— . Our results also imply the first quadratic time-space tradeoffs for Boolean decision branching programs verifying 

t> ' circular convolution, matrix-vector multiplication and discrete Fourier transform. A quadratic tradeoff is the largest 

possible for all these problems. Using the constructive family of Justesen codes which are asymptotically good, 
we also demonstrate a constructive Boolean decision function which has a quadratic expected time-space tradeoff 
qq . in the Boolean deterministic decision branching program model. 

' For (j-way programs where q is a constant, the tradeoff results derived here for decision functions verifying 

\& . various functions are order-comparable to previously known tradeoff bounds for calculating the corresponding 

multiple-output functions. In deriving these bounds we use several bounding techniques and introduce a few 
new ideas. These include a particular measure of progress which is specific to the decision function considered, 
partitioning the computational paths into disjoint sets and obtaining tradeoffs for each class separately and extensive 
use of linear constraints to obtain probability bounds. 
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I. Introduction 

The branching program is a fundamental model of nonuniform computation, which conveniently 
captures both time and space restrictions. See Section II- B for a formal definition. Proving lower bounds 
on computational resources (such as time and space) required to compute a specific (explicit) function in 
a general computational model is a notoriously difficult task. Topics of interest in this direction are both 
the bounds themselves as well as the methods for obtaining these bounds. 

There are many different types of branching programs, and it is important to distinguish between them. 
The first major distinction, between decision (single-output) branching programs and multi-output branch- 
ing programs, has to do with how the output of the program is produced. In the case of decision branching 
programs the output is a single bit: each sink node is labeled by either or 1, and the output of the 
program is simply the value labeling the sink node reached. In the case of multi-output branching pro- 
grams, the output is a sequence of m > 1 values, and each node in B is allowed to assign (write) at most 
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one of these m values. In general, it appears to be much harder to prove lower bounds on time and space 
for decision branching programs than for multi-output branching programs. The second distinction has 
to do with large domains versus small (boolean) domains. In the large-domain case, each input variable 
takes values from a set whose size grows with the length of the input (the number of variables). In the 
small-domain case, the input variables take values in a fixed set of constant size — the set {0,1} for 
boolean domains. Again, it appears to be more difficult to prove time- space lower bounds for boolean 
(2-way) branching programs than for branching programs over large domains. Finally, one distinguishes 
between general, unrestricted, branching programs and branching programs that are restricted in some 
important way. Common restrictions include read-k (each input variable is accessed at most k times) and 
oblivious (input variables are read in the same order along all paths) branching programs. Obviously, it 
is much harder to establish lower bounds for the more powerful, unrestricted, model than for branching 
programs that are read-fc, oblivious, or otherwise restricted. 

In this paper we seek to answer one of the fundamental questions in theoretical computer science: 

"Can the verification of a particular computation be as complex as performing the computation itself?" 

We will also be concerned with the most difficult case — unrestricted boolean decision branching 
programs — where the state of knowledge concerning lower bounds on time and space for concrete 
functions was rather pathetic until recently. In fact, no such bounds at all were known until the 
groundbreaking work of Beame, Jayram, and Saks [BST98,BJS01]. In [BST98] and [BJS01], the authors 
extended the techniques of Borodin, Razborov, and Smolensky [BRS83] to prove the first (barely) nontrivial 
bound of this kind. They exhibited a problem in P, based upon quadratic forms over a finite field 
(cf. [BRS83]), for which any sub-exponential size branching program requires time at least (1 + e)n, 
where n is the input length and e > is a constant. In a remarkable breakthrough, Ajtai [Ajt99, 
Ajt98] constructed a polynomial-time computable Boolean function (also based on quadratic forms) for 
which any sub-exponential size branching program requires super-linear time. In another breakthrough 
paper, Beame, Saks, Sun, and Vee [BSS03] improved upon Ajtai's results by establishing (for the 
quadratic-form and element-distinctness boolean functions), a time-space tradeoff of the form T = 
n(n-\/log(n/S)/ loglog(n/S) ), which furthermore extends to randomized branching programs with 
(two-sided) error. In the last couple of years, there was more work along these lines (see [SW03,Juk02] 
and other recent papers). Nevertheless, the number of concrete decision problems for which super-linear 
time-space tradeoffs are known in the unrestricted boolean branching program model remains preciously 
small. Moreover it should be noted that none of these tradeoff results are valid when T = O(nlogn) 
- therefore all the above mentioned results are for time-restricted branching programs. In fact, proving 
quadratic time-space tradeoffs for unrestricted time decision branching programs has been a major goal 
of recent research, as mentioned for example by Beame, Saks, Sun and Vee in [BSS03] - a similar goal 
having been long since achieved for multiple-output branching programs [BC82, Yes84, Abr91]. 

Herein we prove a quadratic expected time-space tradeoff of the form TS = and valid for 

unrestricted (including time) q-way deterministic decision branching programs, where q ^ 2. Here T is 
the expected computation time and S is the expected space. This bound is to our knowledge, the first 
such to show an exponential size requirement which holds for T = 0(n 2 ). As mentioned before, previous 
exponential size tradeoffs for Boolean decision branching programs were valid when T = o(nlog 2 w). 
The branching programs we consider are related to families of good linear codes. The tradeoff results 
shown here for decision functions are order-comparable (for ^-way programs where q is a constant) to 
the tradeoff bounds obtained by Santhi and Vardy [SV06] for a closely related multiple-output function. 

We also remark that using the constructive family of Justesen codes which are asymptotically good, one 
may also demonstrate a constructive Boolean decision function which has a quadratic expected time-space 
tradeoff in the unrestricted deterministic Boolean decision branching program model. 

Our results also imply the first ever quadratic time-space tradeoffs for unrestricted deterministic Boolean 
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Table I Various Bounds for deterministic branching programs are compared. C is an (n,k,d)q code, E<j is 
its encoding function, /q is the dual syndrome-vector function and is the partial verifier for 
the dual syndrome-vector (code-coset membership). These are the tightest among all such known 
distance bounds. 



Type of branching program 


Tradeoff 


Computed function 


q-way, multi-output, unrestricted deterministic 


TS = Q(« 2 log 2 q) 


n-length CONV, n-length 
MVMUL and n-point DFT 
(see [Abr91]) 


q-way, multi-output, only restriction: T = o(n log 2 n), 
worst case complexity, any C 




E c (see [BM05, San06]) 


q-way, multi-output, unrestricted deterministic, linear C 




f& (see [SV06]) 


q-way, decision, unrestricted deterministic, linear C 


d = o(if) 


fc,y ( here ) 



Table II New bounds for decision branching programs (partially) verifying some fundamental functions as 
derived in this article. Previous corresponding non-linear (worst-case) time-space tradeoff results 
known for £conv,7 an d £mvmul,7 were for time-restricted branching programs with worst case 
time, T = o(n\og 2 n )- No non-trivial corresponding result is known for £dft,7- The tabulated 
results are derived using the distance bounds from the final row of Table I. 



Type of branching program 


Tradeoff 


Computed function 


q-way, decision, unrestricted deterministic 


ts = n(f) 


©(w)-length q-ary £mvmul, 7 


q-way, decision, unrestricted deterministic 


Ts = n(f) 


®(n) -length q-ary £conv, 7 


Boolean, decision, unrestricted deterministic 


TS = f!(H 2 logn) 


©(n) -point Cdft,j 



decision branching programs which partial-verify circular convolution, matrix-vector multiplication and 
discrete Fourier transform. A quadratic tradeoff is the largest possible for all these problems, because 
trivial programs can be constructed otherwise. The tradeoff results obtained here for decision functions 
verifying fundamental operations are order-comparable (for ^-way programs where q is a constant) to 
the tradeoff bounds derived by Abrahamson [Abr91] for calculating the corresponding multiple-output 
functions. 

In deriving these bounds we use several bounding techniques and introduce a few new ones. These 
include a particular measure of progress which is specific to the decision function considered, partitioning 
the computational paths into disjoint sets and obtaining tradeoffs for each class separately, the concept of 
partial-verification and extensive use of linear constraints to obtain probability bounds. 

II. Mathematical Background and Definitions 

A. Branching Programs 

The branching program (BP) has emerged as the standard general model for nonuniform sequential 
computation. This model is of fundamental importance (perhaps, second only to the Turing machine); it 
was introduced some 50 years ago by Lee [Lee59] and has been extended, refined and extensively studied 
in a number of papers since then [Kuz76,BC82, Weg87, Abr91, Ajt98, Ajt99,BJS01,BSS03,BST98,BC82, 
BRS83,Weg00]. 

The branching program model imposes no structure on the computation and allows any pattern of 
access to the input. Nevertheless, this model is strong enough to efficiently simulate many other models 
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of sequential computation, such as RAMs with arbitrary instruction sets [BC82]. RAMs of space S and 
time T with an arbitrary instruction set can be simulated by branching programs of size 2°( s ) and time 
T. This underscores the importance of establishing lower bounds on time and space (and the tradeoff 
between them) in the branching program model. A tremendous amount of research has been devoted to 
this problem in the past two decades [Abr91, Ajt99, Ajt98, BM05, BJS01, BSS03, BST98, BV02, BW01, 
BC82, BRS83, Ger94, Juk95, Juk02, MNT93, Mor73, Oko91, Oko02, PagOl, Pip78, Pon98, Raz91, Win67, 
Yes84], and considerable progress has been made in proving lower bounds for less and less restricted 
branching-program models. 

Loosely speaking, a branching program B is a finite directed acyclic graph, with a unique source 
node and one or more sink nodes. Each non-sink node is labeled by a variable and the edges out of 
the node correspond to the possible values of that variable. Executing the program on a given input 
corresponds to following a path from the source node to a sink node, using the values of the input variables 
to determine the edges to follow. We will give a precise definition of branching programs in this section. 

B. Decision Branching programs 

When dealing with deterministic single-output branching programs, the following is the model we will 
use throughout this paper. The definition below follows Beame, Saks, Sun, and Vee [BSS03]. 

Definition II.l Let V be a finite set of size q, and let X be a finite subset of Z of size n. A q-way 
deterministic decision branching program B on domain V with index set X is a graph Q with the following 
properties: 

a. Q is a finite edge-labeled and vertex-labeled directed acyclic graph, with a unique source node; 

b. There are two sink nodes in Q, one is labeled by and the other by 1; 

C. Each non-sink node vofQ is labeled by an index i(v) G X; 

d. The sink nodes have out-degree zero, all other nodes have out-degree q. For each non-sink node v, the 
set of edges starting at v is labeled by the elements of V so that all the q edges are labeled distinctly. 

For simplicity, we henceforth assume without loss of generality that the index set X in Definition II. 1 
is given by X = [n] and write V 1 as V n . 

A computation by a decision branching program B on an input x G V n starts at the source node s, 
reading the value of the variable and following the edge labeled by that value. The process continues 
until one reaches a sink node. We say that B accepts an input x if the sink node is labeled with a "1". 
Otherwise, we say that B rejects the input x. We let B~ l (l) denote the set of all inputs that B accepts. 
Thus B computes the function /: V n -> {0,1} defined by f(x) = 1 iff jc G 

C. Some Branching Program Terminology 

The total number of nodes in B is called its size and denoted by \B\. The space of B is then defined 
by S = log \B\. The length of a computation in B is the number of edges in the corresponding path. The 
time T of B (sometimes also called the length of B) is defined as the maximum length of a computation 
in B. Note that not all paths from the source to the sink in Q are (valid) computations. We define the 
depth of B as the number of edges on the longest path from the source to the sink in Q. 

A branching program B is said to be leveled if the set of nodes of Q can be partitioned into an ordered 
collection of subsets Vq, V\, . . . , Vg, called levels, such that the edges of Q are always directed from a 
node at level V/_j to a node at level V,, for some z'G {1,2, ...,£}. Pippenger [Pip78] proved that any 
branching program can be made leveled without affecting its time T while adding only C(logT) to its 
space S. The width of such a leveled branching program is defined as max{|Vb|, \ Vi\, . . . , \ Vi\}. 
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A branching program B is said to be oblivious if the input variables are read in the same order along 
all possible paths from the source to the sink in Q. A read-r branching program is one in which each 
input variable is read at most r times along any path from the source to a sink in Q. A ^-way branching 
program with q = 2 is said to be boolean. A useful visualization of a boolean branching program is 
as a RAM with 1-bit-wide input registers and a working memory of S bits [Ajt99, BC82]. As already 
mentioned in the introduction, proving lower bounds on the time-space tradeoff is generally considered 
the most challenging for boolean branching programs. Indeed, as shown in [PagOl, Proposition 1], a factor 
of log 2 q is lost when converting any space or time lower bound from a ^-way branching program to a 
boolean branching program. 

The expected-time T of B is the mean time of a computation when the input jc is chosen uniformly 
at random from V n . The nodes of Q can be labeled with the integers 0,1,. . ., \B\— 1 in |i3|! different 
ways. Fix one such labeling. Then, for each input jc G V n , the workspace required by B on input jc is 
defined as the logarithm of the largest integer which occurs as a label of a node in the computation path. 
The expected-workspace is the mean workspace obtained when jc is uniformly random over V n . The 
expected-space S of B can be now defined as the minimum of the expected-workspace, taken over all 
labelling s. 

The candidate functions for deriving time-space tradeoffs for branching programs will be from coding 
theory - these functions will be defined in Section II-D. 

D. Error-Correcting Codes 

Let Fq denote the finite field of order q. An error-correcting code C of length n over is simply a 
subset of F^ . A linear code of dimension k is a fc-dimensional subspace of F^. If C is a linear code, 
then |C| = q k for an integer k. We shall assume that |C| = q k throughout, whether C is linear or not. An 
encoder for C is a one-to-one and onto (bijective) function Eq : F k — > C. The Hamming distance between 
two vectors in F^ is simply the number of positions where they differ. The minimum distance of a code 
C is the minimum Hamming distance between any two distinct vectors in C. When we say that C is an 
(n,k,d)ci code, we mean that C C F^ is such that |C| = q k and the minimum distance of C is at least 
d. If the order of the underlying field is either clear from the context or is not relevant, we will write 
(n,k,d) instead of (n,k,d)q. 

The rate of an (n,k,d) code is k/n and its relative distance is d/n. A set of codes characterized by a 
certain property is called a family of codes (e.g., the family of linear codes). A family of codes T is said 
to be asymptotically good if it contains an infinite sequence of codes 1 C (?/ 2 C^, . . . of increasing length 
n, such that both the rate and the relative distance of all the codes in this sequence are bounded away 
from zero as n — > oo. 

Let C be an [n,k,d) linear code over Fq. A generator matrix for C is any k x n matrix whose rows 
form a basis for C over F^. The dual code is the set of all x G F™ such that (x,c) = for all c EC, 
where (•, •) is the usual inner product over F^. Clearly, C is an (n,n—k,d^) linear code. A generator 
matrix for is said to be a parity-check matrix for C. 

An (n,k,d) code C is said to be maximum distance separable (MDS) if its minimum distance satisfies 
d = n — k + 1. The well-known Singleton bound implies that this minimum distance is the largest possible 
for an (n,k,d) code. For a comprehensive overview of MDS codes and their properties, see [MS77, 
Chapter 11]. 

Although we will be interested in decision functions related to error-correcting codes in this paper, it 
is useful to consider a few multiple-output functions related to coding. One such function is the encoder 
function Eq defined above. Another multiple-output function of importance is the co set-identifier function, 
which gives the coset (with respect to the code, C) to which an input jc G F^ belongs. For a linear code 
this is equivalent to the syndrome-vector function: 
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Definition IL2 If C is an {n, k, d) linear code over Fq, with generator matrix G and parity-check matrix H, 
we define the syndrome function f<j : F^ — > F } ^ k and dual syndrome function /q : F^ — > F^ as follows: 
f c (x) = Hx l and f£(x) = Gx l . 

Time-space tradeoff results are known [BM05, SV06, San06] for the functions Eq and /q defined above 
on the multiple-output branching program model. These results have been tabulated in Table I. In the next 
section we define and carefully study a closely related decision function. 

III. A Decision Problem from Coding Theory 

We will be interested in decision functions related to error-correcting codes in this paper. The 
characteristic function (also known as the membership function), defined below is a natural choice for a 
decision function: 

Definition III.l Let C he code of length n over Fq. Its characteristic function is the function xc '■ Fq — ► 
{0, 1 } defined by xc (x) = 1 if and only ifx G C. 

However in our study of decision branching programs we found that another function related to the 
membership function of is more convenient: 

Definition IIL2 Let C be an [n, k, d) linear code over Fq. Let G be a generator matrix of this code. Let 7 be 
a real constant such that < 7 < 1. For an input (x,y) e F^" x the decision function fc,j : Fq x Fq — > 
{0, 1} evaluates true, namely fc,j(x,y) = 1 if and only if at least a fraction 7 of the k equations Gx l = y t 
are satisfied. 

For decision problems with large time-space tradeoffs, we observe the following general traits. First, 
the problem should have a nice algebraic structure for establishing the tradeoff. Second, the fraction of the 
acceptance set should preferably not decrease exponentially with the problem size. This second condition 
is necessary to obtain non-trivial tradeoffs. Both of these conditions can be conflicting and difficult to 
simultaneously ensure. 

The decision function /c,7 defined as above appears in some form in most decoding problems 
encountered in coding theory. We have, 

Lemma III.l Let (x,y) e x Fq be an arbitrary tuple of vectors. If J\f C [k] then we denote by G^ the 
sub-matrix of G consisting of only those rows which are indexed by J\f. Then 

(x,y) e/c^(l) ^ 3 jV C [k] such that \AT\ ^ \yk] and Gux 1 - I N y l = 0* 

and, 

(*,jOg/ c >) ^3A/-C [k]and3ze{F q \{0}}W 

such that \J\f \ > - 7)k\ and G Af x t - I^y 1 = z l 
where consists of the rows of the identity matrix I indexed by M. 

Proof. Follows directly from the definition of /c/y- ■ 
Lemma IIL2 The acceptance ratio ol d = ^ c ; 7 + ^^ is given by ol = v ^ k '^ l k ^) k i) ^ w here V q (N,r) is the 

q q 

volume of an N -dimensional q-ary Hamming ball B q (N, r) of radius r. The asymptotic acceptance ratio is 
given by 
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H q (l- 7 )-l, 



if I < 7 < 1, 



..... r lo S, I 1 "*) 



H,(l-7)-l f 



else, 



where Hq(-) denotes the q-ary entropy function. 

Proof. Consider an input (x,y) eF^x F*. By Lemma III. 1, it is accepted iff 3 an index set TV C [k] such 

that |jV| ^ \yk~] and G N x l — I^y 1 = 0*. This happens when dniyK G^x*) ^ [(1 ~~ 7)^J» wn e re ^h(v) 
denotes the Hamming distance between two ^-ary vectors. Since there are a total of ^ M such jc we have 
1/^(1)1 = q n V q (k, [(1 - 7)fcJ) and the result follows. 

It can be shown that (f)(q - If < (,■*]) - as long as i < - J; beyond this, the 

inequality gets reversed. 



(f ) (f ~ ^ ( X - ^ ( fc ~ z 'o + !) (f ) (9 - i)' ' where z 'o = K 1 - l) k \ + !• Takin g logarithms 
and limit the result follows. ■ 

It is easy to see that depending on 7, either a or (1 — a) can decrease exponentially with k. For larger 
alphabets most often oc is exponentially small. The decision function fc~ has the interesting property 
that both the acceptance and rejection sets have a nice linear algebraic structure, which we intend to take 
advantage of. 

In prior work, several quadratic time-space bounds valid for T = 0(n 2 ) for multiple-output branching 
programs where successfully derived. In all those instances, the principal means for obtaining these bounds 
was furnished by a convenient measure of the number of correct outputs generated by the program on any 
partial computation path. In single output branching programs, there is only one output variable precluding 
use of this particular strategy. For example, in the papers of Beame et al [BSS03] and Ajtai [Ajt99, Ajt98] 
the properties of a core set of input variables read exclusively by a partial computation path is used to 
obtain the time-space tradeoffs. However this technique has so far not resulted in a quadratic time-space 
tradeoff for Boolean decision programs which is valid for any T. In this paper, we use a new measure of 
progress which is made possible by a careful choice of the decision function. Our measure of progress 
in computing the decision function is the number of variables corresponding to the vector y that B reads 
during the computation and which in turn must satisfy certain linear constraints inherent in the problem. 



Let B be a q-ary deterministic decision branching program which computes fc,y Let us assign a 
labeling to the nodes of B which minimizes the expected-space of B. 

Definition IV. 1 A branching sub-program & of a branching program B is a collection of partial 
computational paths in B. Let P be an arbitrary partial computational path in the collection . P consists of 
a partial computational path which starts from a node in B and ends in another node in B. Each such member 
of can have a different start node (and/or sink node). The expected time of a branching sub-program is 
the average length of the partial computations in the sub-program. The expected space of & is the average 
over all partial computations P in £P of the maximum node labeling on any node in P. The node labeling is 
assumed to be assigned to the nodes in B so as to minimize the expected space of B. 




IV. A Quadratic Time-Space Tradeoff for fc, 



Now let B\ be the branching sub-program consisting of all computation paths starting from the root node 
of B, and ending in the accepting sink node. Similarly, let Bq be the branching sub-program consisting 



s 



of all computation paths from the root node of B and ending in the rejecting sink node. The following 
relations are obvious: 

Lemma IV.l Let the expected-time and expected-space of B beT and S. Similarly, let the expected-time 
and expected-space of the computational paths in Bo and B\ be denoted as To, So and T\, S\ respectively. 
Then, 

T = «Ti + (l-«)T (1) 
S = aSx + (1 - a)S (2) 

where oc d = ^„ + ^ is the acceptance ratio. 

Proof. Let us denote the computation time on input (x,y) by f( x ,y)- Then 

T =E^#= E E ^ = *T 1 + (i- a )To 

(x,y) (x^EB-Hl) (x,y)eB-H0) 

since and i3 _1 (0) disjointly partition x F^. Expected space is treated in a similar manner. ■ 

We now proceed to obtain a time-space tradeoff valid for B. Towards this, we will first obtain time- 
space tradeoffs for B\ and then Bo separately, before using Lemma IV.l to obtain the desired result. We 
have, 

Lemma IV.2 Let B be a q-way branching program of depth 5. Forr = 1,2, ... ,5, let rj r denote the number 
of computation paths in B which read exactly r different input variables. Then 

E M~ r = 1 (3) 

r=l 

Proof Let P denote a specific computation path in B that reads exactly r input variables, and assume 
w.l.o.g. that these variables are . . .,y r - Since P is a computation path, the labels of the edges of 

P must be consistent — that is, if v and v' are different nodes of P that read the same variable, then the 
edges of P starting at v and v' must have the same label. Thus let U\, ui, ■ ■ ■ , u r E V denote the labels 
on the edges of P that correspond 1 to . . . ,y r , respectively. Then, the computation of B upon input 

x e V n follows the path P from the source to the sink iff X\ = U\, = ui, ■ ■ ■ ,x r = u r . Now, suppose 
that x is chosen uniformly at random from V n . Then, by the foregoing discussion, the probability that the 
computation of B on input x follows the path P is q~ r . Let denote the set of all computation paths in 
B. Then 

s 

1 = Pr{ computation on x follows P} = VrH r (4) 

since for every x e V n , the computation of B upon input jc follows exactly one path in & . Thus we are 
summing over probabilities of disjoint events that partition the sample space. ■ 

Lemma IV.2 leads to the following elementary result: 

Lemma IV.3 Let B be a q-way branching program of depth 5. Let 2? be a branching sub-program from B. 
For r = 1,2, ... ,5, let t] r denote the number of computation paths in & which read exactly r different input 
variables. Then 

5 

E vr r < i (5) 

r=l 



1 Henceforth, we shall say that U\, U%, ■ ■ ■ , U r are the values which the path P enforces on the variables 1/1,1/2, ■ 
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Proof. Follows immediately from Lemma IV.2. Not all terms in the summation of probabilities in (4) are 
present. Each term is non-negative, implying the inequality. ■ 

We will make use of the following property: 

Lemma IV.4 Let G be ak x n generator matrix for an (n, k, d) linear code C. Let a, b be positive integers 
with a ^ d — 1 and b ^ k. Then every b x (n—a) sub-matrix G' of G is full-rank. 

Proof. Since d — 1 ^ n — k for any (n, k, d) code by the Singleton bound, we observe that b ^ n — a. 
Thus what we need to show is that rankG' = b. To this end, let G" be the k x (n—a) column sub-matrix 
of G such that G' is a row sub-matrix of G" . It is well known that for a ^ d — 1, every n — a columns 
of G contain an information set. Hence rank G" = k and its k rows are linearly independent. It follows 
that the b rows of G' are also independent, and so rank G' = b. ■ 

The following definition which fixes the notion of partial decision will also be used extensively: 

Definition IV.2 Let B be a branching program, and let & be a branching sub-program from B. Let c ^ k 
be a positive integer. Consider a specific computation path P in and a specific vector (x,y) gF ? " x F™ We 
say thatP (c)-decides (x,y) with respect to fc,j iff 

(i) The set of all nodes and edges in the path P is a subset of the set of nodes and edges in the computational 
path Pi v in B corresponding to (x,y) , so that the labels of all the edges in P are consistent with (x,y). 

(ii) The path P reads at least c of the input variables corresponding to the positions iny. 
(Hi) There exists an index setftf = J\f( x ,y) Q [k] with \N( x ,y)\ ^ c sucn that, 

(a) If(x,y) e/c,* (1); then G N x l - I M y l = l 

(b) If(x,y) e fc} 7 (0); then G M x l - I M y l = z f where ze {F,\{0}}M. 

We say that £P (c)-decides (x,y) with respect tofc,j iff the computation path that 3? follows upon input 
(x,y) (if such a path exists) (c) -decides x w.r.t. fc,<y- 

Let us consider the branching sub-programs B\ and £>o obtained from B. By the standard technique, 
level both B\ and Bq separately. To make Bj, j <E {0,1} leveled, first add q edges labeled by all the 
elements of V = Fq from the sink node of Bj to itself. Then, replicate the resulting graph 5 + 1 times, 
where 8 is the depth of Bj. Such replication produces the 5 + 1 levels Vq,V\,. . ., Vg, with \Vi\ = \Bj\ 
for all i. Now, for i = 0,1,. . . ,5—1, redirect all edges from nodes at level V, to nodes at level Vi + \. 
Finally, delete all nodes that are unreachable from the source node at level Vq (deleting a node means 
also deleting all the edges that are incident upon this node). This produces a leveled set of computation 
paths Bj with 5 + 1 levels, whose source cp is the source node at level Vq (in fact Vo = {(p}) and whose 
sink (p is the sink node at level V$ (in fact Vg — {<p}). 

Next, we truncate each Bj, j £ {0,1} to a depth of [(1 + t)7Y|, where T > is an absolute constant 
to be fixed later. Here, truncation to depth T means deleting all the nodes that are unreachable from the 
sink node at level Vj when the direction of all edges is reversed. This produces a leveled branching sub- 
program with T + 1 levels, which we denote by B'j. Observe that Bj and B'j compute the same function, 
while the time and space of B'j are given by T- = Tj and S'- ^ Sj + log Ty. 

We call these two branching sub-programs B'j, /£ {0,1}. Let (x,y) £ B^ 1 ^ for /£ {0,1} and let 
t( X/ y) denote the computation time on input (x,y) of B. Then by a simple application of Markov inequality, 

f* v) ^ T(l + t)T/1 } ^ 1 ^ — — 

1 [x,y) ^ IV ) j\S ' (1+T)T 1 + T 
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Hence the branching sub-programs B'-, j E {0,1} successfully complete calculating the decision function 

with probability at least ^7 when (x,y) is chosen uniformly at random from /c~(/'), /g{0, 1}. The 

expected- space Sj required for B'- cannot exceed Sj + log 2 [(1 + t)Tj] for / E {0, 1}. The expected-time 

Tj of B'j increases due to the leveling process and decreases during truncation. We will not need an 

estimate of Tj in this paper. The following result provides us with a crucial measure of progress of 
computation. 

Lemma IV.5 Let BL j E {0, 1} be segmented into B blocks. Then 

(i) Let (x,y) E B'^ 1 (l). There exists some block &( x ,y) and a branching sub-program fully contained in 
that block which ( [ -^p] ) -decides (x,y) w.r.t. /c/y- 

(ii) Let (x,y) E <6q _1 (0). There exists some block &( x ,y) an d a branching sub-program fully contained in 
that block which ( \ {[{1 ^ ki+1) ]) -decides (x,y) w.r.t. f CfT 

Proof. First let us observe that due to the truncation process, there are some inputs which do not reach 
a sink node in either of the two branching sub-programs B[ or B' Q . Therefore in general B'^ (1) U B'^iO) 
is not equal to x . 

(i) By assumption, (x,y) E Therefore, the computation path P which B[ follows on (x,y) ends 
in the accepting sink node. Now assume to the contrary that there are no blocks which ([J^p]) -decide 
(x,y) w.r.t. fc,y- By hypothesis, the condition (z) is satisfied in Definition IV. 2. Therefore conditions (ii) 
and/or (in) (a) must be the ones which are not satisfied. That is, either there is no block in B[ which 

reads more than [p?pl of the variables corresponding to the input vector y, and/or the read variables do 
not satisfy the linear constraints of (in) (a) in Definition IV. 2. Therefore the computational path P that 
B follows on input (x,y) does not read more than \yJc\ — 1 of the variables corresponding to y which 
simultaneously also satisfy the linear constraints in Lemma III. 1. This means that there are always some 
inputs which are erroneously accepted by B due to this computational path P. These are inputs of the 
form (x,y), such that (x,y) and (x,y) coincide on the variables actually read by P and yet gj ■ x = yi 
is satisfied for fewer than pyfc] rows of G. This is not possible for a deterministic decision branching 
program B which should be correct on all possible inputs. This proves the first claim. 

(ii) This part of the proof uses a similar technique as the first part. Once again if we assume to the 
contrary, condition (ii) and/or (iii)(b) of Definition IV. 2 must be the ones which are not satisfied, and 
this cannot be the case for a deterministic program. ■ 

We have the following result: 

Lemma IV.6 Let G be the generator matrix of an (n,k,d) linear code C over Fn. Let < 7 < 1 be an 
absolute real constant and let c ^ k be a positive integer. Let the acceptance ratio of the decision function 
fc,y be denoted as oc. Let & be a branching sub-program from a branching program B of depth 5 < d + c. 

(i) If (x,y) is chosen uniformly at random from * (1), then 

(-Y 

Pr{^ (c) -decides (x,y) w.r.t. / C/7 } ^ \- 

(ii) If (x,y) is chosen uniformly at random from *(0), then 

(i--y 

Vr{@> (c) -decides (x,y) w.r.t. f C/7 } ^ 

Proof. In order not to repeat parts of the proof, we initially consider any j E {0,1} and then specialize 
for the two particular cases when necessary. Let S be the event that the branching program B (c) -decides 
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(x,y) w.r.t. fc,j, ] £ {0,1}- For each computation path P in B, let £pbe the event that P (c) -decides 
(x,y) w.r.t. fey- Then clearly 

Pr{£}= E Pr (^} ( 6 ) 

Now let P be a specific (fixed) computation path in B and suppose that 

(a) P (c) -decides (x,y) w.r.t. /b/y, and 

(b) P reads exactly r input variables. 

The assumption that P (c) -decides (x,y) in particular implies that P is contained in the computation path 
that B will follow upon input (x,y). Let u and v be the vector of values that P enforces on the input 
variables corresponding to jc and y that it reads respectively. Then (x,y) must satisfy Xj = Uj if Xj is read 
by P and \/j = Vj if \)j is read by P. All events are considered subject to the condition that (x,y) G f@ *(/). 
Therefore by Definition IV. 2 and Lemma III. 1 we have the following: 

(i) j = 1: P induces a set of & ^ c linear equations of the form Ylf=iSi,l x l ~ Vi — which (jc,j) must 
satisfy. This also implies that r ^b. Together the induced system of linear equations is G*x t — I^y 1 = t . 
Here G* consists of b distinct rows of the generator matrix G with a possible permutation of the columns 
to match the variable ordering in jc and It consists of the corresponding b rows of the k x k identity 
matrix I. corresponds to the vector of correct decisions made by P. Combining all of the above, we 
find that (x,y) must satisfy the following system of r + b linear equations: 



J* 


@bxn 


T* 
l b 


G* 


— 1* 
l b 



X' 



-b)x(n+k) 



. J (n+k)xl 



u 



f 



(7) 



-b)xl 



where I*_ b 



are some (r—b) rows of the (n+k) x (n+k) identity matrix corresponding to the input 
variables read by P except the b input variables in_y which belong to the set of satisfied linear constraints. 
Those input variables are counted as part of It. Let M be the (r+b) x (n+k) matrix in (7). Then 
rankM = rank J* ,, + rank/^ +rankG / , where G' is the b x (n—r+b) matrix consisting of some 
(n—r+b) columns of G* which are zero columns in /*_,. Since r ^ 8 < d + c, we conclude that 
rankG' = b by Lemma IV. 4 and rankM = r + b. It follows that the linear system (7) has exactly 
qn+k-b-r distinct solutions in F" +k , and so 



Pr{£p} 
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n+k~b—r 



i/a 7 (i: 



DC 



(8) 



Observe that the bound on Pr{£p, (x,y) e/ c ^(l)} in (8) depends only on the number r of the input 
variables that P reads. If there are n r computational paths in & which read exactly r variables then, 



Pr{£} ^ £n r q 

r=l 



rH 



a. ol 



where the first inequality follows from (6) and (8), while the second one follows from Lemma IV.3. 



(ii) j = 0: This case is similar. P induces a set of b ^ c linear equations of the form X^" =1 gi iX\ — \j\ = Z\ 
which (x,y) must satisfy, where z 2 - e {F^\{0}}. This again implies that r ^ b. Together the induced 
system of linear equations is G*x t — l^y 1 = z i ■ Here G* consists of b distinct rows of the generator 
matrix G with a possible permutation of the columns to match the variable ordering in jc and 1^ consists 
of the corresponding b rows of the k x k identity matrix I. z corresponds to the c linear constraints satisfied 
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by P w.r.t. fc,j and can be any vector from {F,j\{0}} fc . Therefore (x,y) must satisfy the following system 
of r + b linear equations: 



J* 

L r-b 


@bxn 


J* 
L b 


G* 


l b 



X' 



J (r+b)x(n+k) 



Lr J (n+k)xl 



u 



. J (r+b)xl 



(9) 



where I* . are some 



r— fc) rows of the (n+fc) x (n+fc) identity matrix corresponding to the input 
variables read by P except the b input variables in y which belong to the set of satisfied linear constraints. 
Those input variables are counted as part of If. Let M be the (r+b) x (n+k) matrix in (7). Then 
rankM = rankl*_t + rank 7^ +rankG / , where G' is the b x (n—r+b) matrix consisting of some 
(n—r+b) columns of G* which are zero columns in /*_,. Since r $C 5 < d + c, we conclude that 
rankG' = b by Lemma IV. 4 and rankM = r + b. It follows that the linear system (9) has has at most 
l) b ■ q^+k-b-r distinct solutions in F" and so 



Pr{£ P } ^ 



(q-1 



\ b -q n - 



i/cA(°) 



■k—b—r q 



1 



(10) 



The bound on Pr{£p, (x,y) G/ c ^(0)} in (10) depends only on the number r of the input variables that 
P reads. If there are rj r computational paths in which read exactly r variables then, 



Pr{£} ^ Y^t] r q~ 

r=l 



,(1 



i)c 



(1 



ly 



1 — 0L 1 — 0. 

where the first inequality follows from (6) and (10), while the second follows from Lemma IV.3. ■ 

Theorem IV.l Let C be an (n,k,d) linear code over F„. Let B be a q-way branching program which 
computes the decision function /c, 7 in expected-time T and expected-space S. Then B satisfies the time- 
space tradeoff given by: 



where a 



6T (S + log 2 T + 7) ^ kd ■ maxja: 2 7log 2 q, (1 

de f \B~Hl) 



! (l-7)lo g2 ^r 



(ID 



7 H+/C 



is the acceptance ratio for B. 



Proof. We obtain tradeoffs for Bj, j E {0, 1} separately and then use Lemma IV.l. So consider Bi. Earlier 
we used a leveling and truncation procedure to obtain B'-. We also showed that with probability at least 
j^pp, this leveled and truncated collection of computation paths (branching sub-programs) successfully 
completes calculating the decision function fcy(x,y) for an (x,y) chosen uniformly at random from 

Let us now segment each BL j £ {0, 1} separately into Bj equal sized blocks of depth Sj (except perhaps 
the last block, which could be shorter). For Lemma IV.6 to be applicable, we should ensure that Sj < 
d + Cj. Here Cj are given by an application of Lemma IV.5 thus: 

(i) j = 1: For every such (x,y) for which the acceptance decision is successfully made by B[, there 



must be some block which correctly decided in the affirmative a set of questions of the form 



grx- yi 







(12) 



def 



for all i G N(x,yy where N( x ,y) — W ^ s an index set such that C\ \N(x,y)\ ^ l^B^I ^ ^ 
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(ii) j — 0: Similarly, for every such (x,y) for which the rejection decision is successfully made by BL 
there must be some block which correctly decided in the affirmative a set of questions of the form 



gi-x- ]fi 7^ 



(13) 



def 



for all i G Mi Xi y\, where N( Xl y) ^ W\ i s an 

index set such that c = \M^ y) \ ^ [ (L(1 y J+1) ] ^ 1. 



We see that choosing Sj = d satisfies the requirements and so By 



Let b be any of the blocks. Let v be an arbitrary node on the starting boundary level l\, of this block. 
We will use the following notation: 

def 

Pj /V (x,y) = Pr{^V (cy)-decides (x,y) w.r.t. f c , 7 } 

where & v is the branching sub-program consisting of a collection of partial computation paths starting 
at node v up to but not including nodes in the next boundary level 

The probability of successful computation of the decision function fc^ is upper bounded by the 
probability of the existence of a block with such an index set N(x,yy For an (x,y) chosen uniformly 
at random from this later probability is at most 

E E PjA x >y) = E E PjA x >y) + Yl E VjA x >y) 

b vee b : b vee b -. 



b v£4 



l(v)<2 aS i 



where l(v) is the optimal labeling earlier assigned to the node v and cr > 1 is an absolute constant to be 
fixed later. By Lemma IV.6, we have the first term upper bounded by either 



(ii) j = 0: 



2 aS i 
a 

2^0 
(1=5) 



or, 



(l_l)l B I 



To upper bound the second term, we invoke Markov inequality. If W( x ,y) denotes the workspace required 
for computation on (x,y), then W( X/ y) = log 2 [max v g p . Z(v)] where P( x ,y) is me unique computation 
path corresponding to (jc,v). So 



Pr(Z(v) ^2 £rS /,vGP ( , 



x,y), 



Prfa; 



(*,y) 



^ crS 



1 



We also know that Sy ^ Sy + log 2 [(1 + T)Ty] . Combining all of the above we get the following two 
inequalities, 

(i) j = 1: 



lo g 2 



T 1 

1 + T £7" 



+ 



f7*l 



log 2 ^ C r(S 1 +log 2 r(l + T)r 1 l) 



(14) 



(ii) = 0: 



lo g 2 



_r I 

1+T <T 



+ 



L(i-?)/cJ+i 



r(i+T)r i 



lo §2 (^T 



^cT(s + io g2 r(i+T)T i; 



(15) 
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From Lemma IV. 1 we get, 



and 
and 



Sn ^ i — - 



1-a 



Now we use the above bounds in (14) and (15) to obtain: 



lo g 2 



and, 



log 2 (1 



+ 



\ 7 k] 



\(1+t)T/oc] 
d 



logger- | + log 2 



(l±l)I 



T 
1 + T 



+ 



[(1-7)*] +1 



r(i+T)r/(i- a )i 



log 



_2_ 

2 U-l 



(16) 



(17) 



Each row of G is of weight at least d, thus any question of the form (12) or (13) can be answered only 
if f(x,y) ^ d+1. This means T ^ d+1. After some algebra, the tradeoffs implied by (16) and (17) for the 
branching program B are found to be as in (*) in Table III on page 14. 



Table III Intermediate steps during the derivation of (11) from (16) and (1 7) in the proof of Theorem IV. 1. 



■ T ■ (S + « • log 2 [Ml - (f ) • log 2 [« ■ - I)] 
• T • (5 + (1 - «) • lo g2 [^] -(*=*)• log 2 [(1 - «) • - i 



> fcdlog 2 ' -1 



If we let l 



(2 + T) 



— f — ; 1/lr then (t) reduces to: 



<r(2 + T)T(s + K-\og 2 T- (1+ I)alog 2 a + ft;log 2 c) 



fcd log 2 



fcda 2 7log 2 



cr(2 + t)T (S + (1 - *) ■ log 2 T - (1 + i)(l - «) log 2 (1 - «) + (1 - «) log 2 f) > - «) 2 (1 - 7) log 2 (^) 



(t) 



(*) 



The bound of (*) may be optimized with respect to the proof parameters t and cr for a specific 7 
subject to T > and u > 1 + ^ For example we chose T = \/2 and cr = ?. Finally observing that when 
< j6 < 1, -jSlog 2 (j6) ^ ^ we get the claimed tradeoff of (11). ■ 

Corollary IV.l Let C be an (n, k, d) linear code from a family T of asymptotically good codes over Fq. Let 
the decision function /c 7 be computable in the q-way branching program model in expected-time T using 
expected-space S. Then the sequence of such branching programs must satisfy the asymptotic time-space 
tradeoff: 

TS = CL(f) (18) 

Proof. Either a or (1 — a) is at least i. Also 7 is an absolute constant between and 1. The family T is 
asymptotically good which means k = @(n) and d = 0(n). Further, observe that log 2 ^ ^ log 2 (^zf)- 

Expanding as a power-series, log 2 (^) = - log 2 (l - *) = £~ =1 ^^2- Thus f > lo §2 (ipi) > J 
since q ^ 2. So we get (18) and therefore Theorem IV.l implies a quadratic tradeoff when q = ©(1). ■ 

Corollary IV. 1 implies quadratic time-space tradeoffs for Boolean decision branching programs verifying 
certain fundamental algebraic functions. Using algebraic code constructions, it is also possible to 
demonstrate a constructive decision function which has a quadratic time-space tradeoff in the Boolean 
decision branching program model. 
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V. A Constructive Decision Function with Quadratic Time-Space Product 

In this section we present an explicit constructive decision function which has a quadratic time- 
space product in the Boolean decision branching program model. The function is related to the class 
of constructive binary codes called Justesen codes [Jus72]. Justesen codes are instances of concatenated 
codes which are asymptotically good. The following definition is based on the exposition in [MS77]. 

Definition V.l Let M be an [N = 2 m - 1, K, D = N - K + 1] Reed-Solomon code over F 2 »< and let a be a 
primitive element of F^"- Let a = (uq, a\, . . . ,flN-l) be a codeword from M, such that a j E F™ ■ Let b be the 
vector (flO/ fl o; ci\,0La\; a^op-ai) fliv-l/ # N_1 flN-i)- Replace each of the 2N components ofb by the 
corresponding binary m tuple to get a binary vector c of length 2mN. For any N and K, with < K < N, 
the Justesen code ^n,k consists of all such vectors c which are obtained from 1%. 

It is clear from the definition that is a rate R = ^ \ linear binary code of length n = 2m N. 

It is also well known that the minimum distance of is d ^ n(l — 2R)(H 2 " 1 (2) — o(l)) ^0.11n(l — 

-£). See [MS77] for a proof. Invoking Corollary IV. 1 , we have proved the following: 

Theorem V.l Let < 7 < 1 and < R < \ be absolute constants and let the decision function f j? nk ,j 
be computable in the 2-way branching program model in expected-time T using expected-space S. Then the 
sequence of such branching programs must satisfy the asymptotic time-space tradeoff TS = G(n 2 ) . 



VI. Quadratic Time-Space Tradeoffs for Verifying Fundamental Algebraic 

Operations 

In Section IV we considered the problem of computing /c /7 to partially verify the dual syndrome-vector 
of a ^-ary linear code Cq[n,k,d]. We showed that any ^-ary branching program computing /c~ must 
satisfy the time-space tradeoff: 

6T(S + log 2 T + 7) ^ kd-max{<x 2 7 \og 2 q, (1 - a) 2 (l - 7) log 2 (19) 

where ol is the acceptance ratio of fc,y which is related to 7 and k. 

We also showed that for a family of asymptotically good codes, branching programs which compute 
/c /7 must satisfy the asymptotic time-space tradeoff: 

TS = n(f) (20) 

Now we will use the above results to derive lower bounds on the time-space resources required for 
verifying several fundamental operations in the branching program model. Target functions considered 
are: cyclic convolution, matrix-vector multiplication, and the discrete Fourier transform. These are defined 
precisely below: 

Definition VI.l Let (a(X))2 denote the binary vector of length n representing the detached coefficients 
of a polynomial a(X) in the ring F2[X]/(X M — 1). Given a fixed polynomial b(X) in this ring, the 
n-bit convolution function CONV b ( X ) '■ {0, l} n — > {0,1}" is defined by CONV fo ( X ) ((fl(X)) 2 ) = 
(a(X) © b(X))2, where denotes polynomial multiplication over F2 moduloX n — 1. 

Definition VI.2 Let M n/tl denote the set of alln x n binary matrices. Given a fixed binary matrix B £ M n n , 
the n-bit matrix-vector product function MVMULb : {0, 1 }" —> {0, 1 } n is defined by MVMULb (x) = xB, 
where xB denotes the usual (row) vector-matrix product operation over F2. 
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Definition VI.3 Let n = 2 m . For a vector x = (xq, X\, . . . , x n _2) ° ver F«> lot (x)2 denote the binary vector 
of length mn obtained by concatenating the binary representations of Xq, X\, . . . , x n _2 with respect to a fixed 
basis for F n over F 2 . Given a primitive element oc of F n and a fixed fraction p = k/ (n — 1) in (0, 1), the 
k-point DFT function DFT pA : {0,l} m ( M - 1) -> {0, l}^*- 1 ) is 

DFT pA f(x} 2 ) = f ( ( £ x if £ V/ • • • / E ■ ■ • ' E V 

Z=0 Z=0 !=0 !=0 

The following definition fixes the notion of partial verification of a vector valued function: 

Definition VI.4 Let V be a finite domain. Consider a vector valued function f : V n — > V . Let 7 be 
an absolute real constant such that < 7 < 1. For an input (jc,j) G £>" x P fc the decision function £fy : 
V n x V k — > {0, 1 } evaluates true, namely £ f 7 (jc, j) = 1 if and oniy if at ieast a fraction 7 of the equations 
fi(x) = yi are satisfied. Here f(-) denotes the i th coordinate of the evaluated function. 

Our results for unrestricted deterministic decision branching programs partially verifying fundamental 
operations are summarized in Table IF The detailed proofs are given below. We have the following results: 



Theorem VI. 1 Let 7 be an absolute real constant such that < 7 < 1. A deterministic boolean decision 
branching program B computing either £conv,7 or £mvmul,7 in expected time T and expected space S 
satisfies: TS = H(n 2 ) 

Proof. Let C(2n,n,d) be a rate half systematic quasi-cyclic code with generator matrix of the form G = 
[I n I C] , where C is a square n x n circulant matrix. A result due to Chen, Peterson and Weldon [CPW69] 
and Kasami [Kas74] says that the family of rate half quasi-cyclic codes achieve the binary GV bound. 
Moreover they show that this occurs when the polynomial associated with the first row of C is relatively 
prime to (x n — 1). It is also clear that the columns of the parity check matrix can be rearranged to get 
G. Therefore the dual code C 1 - is equivalent to C. 

Let the binary polynomial associated with the circulant matrix C be g(x). The encoding of a binary 
information polynomial i(x) E GF(2)[x]/ [x n — 1) can then be represented in the following form: 

i(x) 1 > c(x) = (i(x) , i{x) &g(x)) 

where represents CONV operation. 

So as to arrive at a contradiction, let us suppose that £conv,7 has a time-space tradeoff given by TS = 
o(n 2 ) in the branching program model. A trivial modification of such a branching program will compute 
/q in TS = o(n 2 ). To see how, consider an input (x,y) E F 2n x F^ and observe that gi ■ x = X\ + Cj • Jc, 
where gi is the i th row of the generator matrix G and c ; is the i th row of the circulant matrix C. Here Jc 
denotes the vector in formed by taking only the last n coordinates of x. The term C{ ■ Jc forms a part 
of the CONV operation. This means f@ can be implemented using £conv,y which operates on (x,y — Jc), 
where Jc denotes the vector in F£ formed by taking only the first n coordinates of jc. Finally, since by 
(20), /q computation satisfies TS = Q(n 2 ) in the branching program model, so should £conv,7- 

To obtain the time-space lower bound for £mvmul,7> consider the problem of computing Gjc, given an 
input vector jc, where G represents the generator matrix of a random linear code. As the family of linear 
codes is asymptotically good, by (20), we get a time-space tradeoff for £mvmul,7- ■ 

Theorem VI.2 Let 7 be an absolute real constant such that < 7 < 1. Let n = 2 m be an integer. Let 
< p < 1 be a fixed fraction and let k = p(n — 1). A deterministic boolean decision branching program 
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B which partially verifies a k-point DFT by computing £dft,7 in expected time T and expected space S 
satisfies: TS = Q(n 2 log 2 ri) ■ 

Proof. As in the proof of Theorem VI.2, consider a length (n — 1), dimension k Reed-Solomon code, 1Z. 
Let the roots of its generating polynomial in be the (n — 1) consecutive powers od of a primitive 
element tx G F2<". Both 1Z and 1Z 1 - are MDS codes and have fixed rates. The syndrome computation for 
1Z is precisely the computation of a /c-point DFT. 

Now take the binary image code of 1Z, which we denote by C. C is a ((n — l)m,km,n — k) linear 
binary code and £dft,7 is precisely /c /7 . The result follows on applying (19). ■ 

VII. Summary 

We derived a minimum distance bound for codes using the deterministic branching program model for 
non-uniform sequential computation of a decision function related to code-coset membership. Specifically, 
we looked at the deterministic branching program complexity of verifying the syndrome vector of linear 
codes. We derived the first ever quadratic time-space bounds for unrestricted deterministic decision branch- 
ing programs. 

The minimum distance bounds so derived were used to obtain the first ever time-space tradeoffs for 
unrestricted deterministic decision branching programs verifying several fundamental operations. These 
results are derived making use of deep new connections between the properties of certain algebraic codes 
and fundamental algorithms. 
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